-
Notifications
You must be signed in to change notification settings - Fork 919
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add a library_design.md file documenting the core Python data structures and their relationship #10817
Add a library_design.md file documenting the core Python data structures and their relationship #10817
Conversation
Codecov Report
@@ Coverage Diff @@
## branch-22.06 #10817 +/- ##
================================================
+ Coverage 86.29% 86.32% +0.02%
================================================
Files 144 144
Lines 22656 22668 +12
================================================
+ Hits 19552 19569 +17
+ Misses 3104 3099 -5
Continue to review full report at Codecov.
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Really nice job @vyasr. I have a few suggestions attached. Let me know if you want another round of review, but I wouldn't spend more than 1 more round on this. (Our collective time is better spent on porting the rest of the proposed developer docs into Markdown than "perfecting" this document.)
rerun tests |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is awesome work. I love the approach of decomposing the library into three layers and explaining each one! Have a few requests mostly around making the document more approachable to first-time/casual readers.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I like the changes you made. @shwina raises some good questions, but this is green-light from my side.
@shwina this is ready for another round of review. Let me know if you want me to render and show you any part of the docs. |
This is looking good! I'm approving with a few small suggestions. |
@gpucibot merge |
rerun tests |
This PR adds an
library_design.md
file discussing cuDF's internal architecture, including its core data structures, their purpose, and how they related to pandas and libcudf objects. The document is not short, but it aims to avoid being too long by focusing mainly on the layout between classes and how they interact. I do not discuss implementation details for specific functionality (e.g. the Merge or GroupBy classes), nor do I go into detail on the layout of files on disk. The emphasis is on understanding the different principal components and how they fit together.This PR contributes to #6481. Subsequent PRs will focus on other aspects of a developer guide, such as a more information on how to contribute, write tests, benchmark, and write documentation.